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Input file flhl49E6cons; Output File flhl4926tra 
Sequence length E818 

CCACGCGTCCGATTTATAGAAAGAAGCTATAGTCTGTCGGGATATCCAACACCAACAGGGCTTACTGAGAGCTCCATTT 

CTGGAAAGCCTTACAAGACTGAGGAATATCAGACTGCGAATCACCGGGAACGGTTCCTTGCAGCACAGAAGCAATCTC 

TCTCCCCATCTTCGCATATTCTAATGGCAAAACAAGTGGAAGAAAAGAGGAAGCATGACTGCAGATCAGATCAGTTCTC 

TTTGTGGATTATATTTTCAGTAAAATGTATGGATCTATCTTTTCCTTGTTCTTATATCTAGATCATGAGACTTGACTGA 

MA NYSHAAD VN . I . L Q 13 

GGCTGTATCCTTATCCTCCATCCATCT ATG GCG AAC TAT AGC CAT GCA GCT GAC .AAC ATT TTG CAA 39 

NLSPLTAFLKLTSLGF.-I IGV 33 

AAT CTC TCG CCT CTA ACA GCC TTT CTG AAA CTG ACT TCC TTG GGT TTC ATA ATA GGA GTC 99 

SVVGNLL I S I LLVKDKTLHR 53 

AGC GTG GTG GGC AAC CTC CTG ATC TCC ATT TTG CTA GTG AAA GAT AAG ACC TTG CAT AGA 159 

APYYFLLDLCCSDILRSAIC 73 

GCA CCT TAC TAC TTC CTG TTG GAT CTT TGC TGT TCA GAT ATC CTC AGA TCT GCA TTT TGT 219 

FPFVFNSVKNGSTWTYGTLT 93 

TTC CCA TTT GTG TTC AAC TCT GTC AAA AAT GGC TCT ACC TGG ACT TAT GGG ACT CTG ACT 279 

CKVIAFLGVLSCFHTAFMLF 113 

TGC AAA GTG ATT GCC TTT CTG GGG GTT TTG TCC TGT TTC CAC ACT GCT TTC ATG CTC TTC 339 

C I S V T R Y L A I A H H R F Y T K R L 133 

TGC ATC AGT GTC ACC AGA TAC TTA GCT ATC GCC CAT CAC CGC TTC TAT ACA AAG AGG CTG 399 

TFWTCLAVICMVWTLSVAMA 153 

ACC TTT TGG ACG TGT CTG GCT GTG ATC TGT ATG GTG TGG ACT CTG TCT GTG GCC ATG GCA 459 

FPPVLDVGTY SFIREEDQCT 173 

TTT CCC CCG GTT TTA GAC GTG GGC ACT TAC TCA TTC ATT AGG GAG GAA GAT CAA TGC ACC 519 

FQHRSFRANDSLGFMLLLAL 193 

TTC CAA CAC CGC TCC TTC AGG GCT AAT GAT TCC TTA GGA TTT ATG CTG CTT CTT GCT CTC 579 

I L L A T Q L V Y L K L I F F V H D R R 213 

ATC CTC CTA GCC ACA CAG CTT GTC TAC CTC AAG CTG ATA TTT TTC GTC CAC GAT CGA AGA 639 

KMKPVQFVAAVSQNWTFHGP 233 

AAA ATG AAG CCA GTC CAG TTT GTA GCA GCA GTC AGC CAG AAC TGG ACT TTT CAT GGT CCT 699 

GASGQAAANWLAGFGRGPTP 253 

GGA GCC AGT GGC CAG GCA GCT GCC AAT TGG CTA GCA GGA TTT GGA AGG GGT CCC ACA CCA 759 

P T L L G I R Q N A N T T G R R R L L V 273 

CCC ACC TTG CTG GGC ATC AGG CAA AAT GCA AAC ACC ACA GGC AGA AGA AGG CTA TTG GTC 819 

LDEFKMEKRISRMFYIMTFL 293 

TTA GAC GAG TTC AAA ATG GAG AAA AGA ATC AGC AGA ATG TTC TAT ATA ATG ACT TTT CTG 879 

FLTLWGPYLVACYWRVFARG 313 

TTT CTA ACC TTG TGG GGC CCC TAC CTG GTG GCC TGT TAT TGG AGA GTT TTT GCA AGA GGG 939 



TO FIG. 1B. 

FIG. 1A. 
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FROM FIG. 1A. 



PVVPGGFLTAAVVMSF'AQAG 333 
CCT GTA GTA CCA GGG GGA TTT CTA ACA GCT GCT GTC TGG ATG AGT TTT GCC CAA GCA GGA 999 

INPFVCIFSNRELRR CFS TT 353 
TTT CCC CCG GTT TTA GAC GTG GGC ACT TAC TCA TTC ATT AGG GAG GAA GAT CAA TGC ACC 1 059 

LLYCRKSRLPREPYC V I * 37 1 

CTT CTT TAC TGC AGA AAA TCC AGG TTA CCA AGG GAA CCT TAC TGT GTT ATA TGA 1113 

GGGAGCATCTGTAAATCTTTAGCCTTGTGAAAACTAACCTTCTCTGCTGAGCAATTGTGGCCCATAGCCATATTTTGAG 
AAGGAAATTCAAGAATGGAATCAGCAGTTTTAAGGATTTGGGCAACATTCTGCAGTCTTTGCAATAGTTCACCTATAATC 
CTATTTTAAATCTCAGAGTGATCCTGCTGACTGCCAGCAAAGGTTTGTAATTAAGAAGGGACTGAACCACTGCCCTAAG 
TTTCTTTATGTGGTCAAAAACTAGATAATGAAAGTAGCAGGTGCTAAGTATCAGTGCTAAATGCTCTGTATGTCACTAC 
ATATGAAAAAACATAAAAAACAATTAGCATTGGACATCTTAATAAATTAAGTTGACATGAGGTAAATGTGTTGATAAA 
AACTAATTTTAGAAGTTTGAAGACTTTAAAACATTTCATACTACTATTGTTTTGCAAAGACTAAAATATTTGGGGACTT 
AAAGTACTGTAATCCACTAAAGACGTGCCAATGAATTATTGGAATATCACACTTTAAAAACCGCCTTGTAAGTTCTGGG 
GAGCATTCCAAAGCAGTATATTGGTTCCAATTAGAGTTTACTTTTTTTGTATTAATACATTGCTATTTCTAAATACCAC 
TTTCCTCATCTACTAGTAAGATTGCTAGCATTGAACTGTATTATGTGGTTTTTGTTGATTTGGTATAAAGTTTTTCCAA 
TTCATTTATATTTTACAAATGCTAGATATTGGTCTGGGAGGCAACATTAATGGTACCAGCCTGTCACAACTGAGCAGTT 
CTAATAATGCAGAATAAACACATGTTGCCTTAAAGGGTTATCTAGKATCCYTTCATCTTATTAGCACTGGAGCAAATAG 
YCAAGGGAAATCRAATCAGTAACTGGTCATGGTCATGCATCTRAAAGTGCATGGAAGATCATTTAGTACTTTTTCCTTT 
TTTCTCACATGGTTTGAAACTTAAAGTGCACATCMCTGAAATAATGAGATTTTCTTTTRMGGTGTGCTACCCTTYYTAR 
ASTGTTCTAAGAAGCAGGCAGTTGATGTATGTTTATATTTTAAGTCAGCTGTCGAGGGGAGACCACAGCCCTTAGTATGA 
CATCCTGCACAATTTGTGAAGCATTTATTCTACTGAAGGCACAGTCTTGTTTATACTTTCTGCACATTCAGTGTATTGG 
TCATTTAAATTATTTCAGTTTTAACTTGTGAAAGCTTATAATATGATTTCTGGTATTTTAGAAATACATTAGAGTCTGT 
GAGTCTCATTCTTTAAGATACANATGTGTGAACTTCAATATAAAGTTGCATTTGCCAAAATTTAAAAAAAAAAAAAAAA 
AAAAAAAAAAA 



FIG. 1B. 
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Protein Family/Domain HMM Matches for f1h14926orfaa 

> PF00001/ 7th 1 7 transnenbrane receptor (rhodopsin fanily) 

Score: 146.01 Seqs 37 339 Hodeh 1 269 

■GN i LVIWvI cRy RRMRTPMNYF I vNL Av ADLLFs I ftMPFWMvYyvMqg 
GN+L+ +++++ + +++ ++YF++ L +D+L S ++PF + + ++ 
flhl4926or 37 GNLL I S I LLVKDKTLHRAP YYFLLDLCCSD ILRSA I CFPFVFNSVKNGS 85 

R WpFGdf MCr I WnYFD YMNMYAS I Ff LTc I S I DRYLW A I CHPMr YnRWMT 
+W++G++ C+++ ++ +++++ F+L CIS+ RYL AI H Y + T 
flhl4926or 86 T WTYGTLTCKV I AFLGVLSCFHT AFMLFC I SVIRYL- A I AHHRF YTKRLT 134 

pRHRAWvMI i i IWvMSFlISMPPFLMFr. VstyrDEneVNnTWCnlyDWP 
+ ++++I+++W++S+++++PP+L + +S+ R E++ C++ + 
flhl4926or 135 FW-TCLAV I CM VWTLS VAMAFPP VLD VGT YSF I REEDQ CTF-Q— 175 

ewMWrWYvILnti i noFY I PM i I M I FCYwR I YR I aR I WMRM IpswQr . . . 
+R++ +GF++ + ++L ++Y + ++ + ++ 

flhl4926or 176 — HRSFR-ANDS-LGFMLLLAL I LL ATQL VYLKL I FFVHDR — RKMKP 217 



flhl4926or 218 VQFV A AVSQNWTFHGPGASGQAA ANWLAGFGRGPTPPTLLG I RQNANTTG 267 



flhl4926or 268 



RRrns. . . . nRrERR i vKM I i 1 1 MvVFI I CW IPYFI vnf MDTLMMwwFCe 

RRR+ +++E+RI++M 1+ ++F+ W+PY ++ + +++F 

RRRLL VLDEFKMEKR I SRMFY I MTFLFLTLWGPYLVACY WRVFAR 3 1 2 



fC. IwrrlWnYIfeWLaYvNCpClNPIIY* 
++ + +++W++++ INP++ 
flhl4926or 313 GPVVPGGFLT-AAVWMSFAQA-GINPFVC 



339 



> MILPAT00028/ nQf NGF / BNDF / Neurotroph I ns 3,4, and 6 family of cytokines 

Scores 0.47 Seq: 290 302 Model* 1 13 
REF xxxxxxxxxxxxx 
xMSMLFYTMFIsYF* 
M+ LF+T+ +Y+ 
flhl4926or 290 MTFLFLTLWGPYL 302 

FIG. 2. 
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Analysis of flh14926orfaa (371 aa) 



Cys 

Ngly 
out 
TM 
ins 




I I III II 
I I I 

V7777i V7777ZZZ1 

VSSSSSA VSSSSSA V77777i V//S/A VSSSSSSA 

4.8 ra 1.2 4.3^4.9 3.7 ^ 





V777771 VS/SSSA 

1 4.6 2.3^^ 



I i I i I i I i I i I i I i I i I i I i I i I i I i I i I i I i I i I i I i 
1 41 81 121 161 201 241 281 321 361 



>flhl4926orfQQ 

MANYSHAADN I LQNLSPLT AFLKLTSLGF I I G VSV VGNLL I S I LLVKDKTLHR AP YYFLL 
DLCCSD ILRSA I CFPFVFNSVKNGSTVTYGTLTCKV I AFLGVLSCFHTAFMLFC I SVTRY 
LA I AHHRFYTKRLTFWTCLAV I CM VWTLSVAMAFPP VLDVGTYSF I REEDQCTFQHRSFR 
ANDSLGFMLLLAL I LLATQLV YLKL IFFVHDRRKMKPVQF VAAVSQNWTFHGPGASGQAA 
ANWLAGFGRGPTPPTLLGIRQNANTTGRRRLLVLDEFKMEKRISRMFYIMTFLFLTLWGP 
YL V AC YWRVF ARGP V VPGGFLTAA VWMSFAQ AG I NPFVC I FSNRELRRCF STTLLYCRKS 
RLPREPYCVH 



FIG. 4. 
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It prositescan will scan one or more sequences against a set of sequence patterns 

Database: Release 12,2 of February 1995 
Tue Apr 7 18: 49: 19 1998 
1025 patterns 

Query= f lhl4926orfaa 

> PS0000 1 /PDDCOOOO 1 /ASN_GL YCDS YL AT I DN N-glycosylation site. 
N[ A *P][ST][ A P] 

Query: 3 NYSH 6 
Query: 83 NGST 86 
Query: 182 NDSL 185 
Query:227 NWTF 230 
Query:264 NTTG 267 

>PS00004/PDDC00004/CAMP_PHDSPHQ_SITE cAMP- and cGMP-dependent protein kinase 
phosphorylation site. 
[RKH2HA-ZHST] 



Query: 131 KRLT 134 
Query:281KRIS 284 

>PS00005/PD0C00005/PKC_PH0TPHD_SITE Protein kinase C phosphorylation site. 
[STHA-ZHRK] 

Query: 80 SVK 82 
Query: 93 TCK 95 

Query: 130 TKR 132 
Query: 178 SFR 180 
Query: 266 TGR 268 
Query: 342 SNR 344 

>PS00006/PD0C00006/CK2 PHOSPHO SITE Casein kinase II phosphorylation site. 
[STHA-ZH2HDE] 

Query: 342 SNRE 345 

> PS00008/PDDC00008/M YR I STYL N-iwristoylation site. 

G[ A *EDRKHPFYW][A-Z][2][STAGCN][ A xPI - _ _ 

D fi^TUTYfi" FIG. 5- 

Query: 84 GSTwTY 89 



